PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.0248s0020.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 909aa    MW: 100385 Da    PI: 8.3399
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.0248s0020.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix66.65.1e-21817897186
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          +W+ +e+++ i++r e+++r++  k +++lWee+s++++++g++rsp qCk+ w +l ++y+++k+ e+++     +++p+f++++
  Cagra.0248s0020.1.p 817 KWKPEEIKKVIRMRGELHSRFQVVKGRMALWEEISSNLSAEGINRSPGQCKSLWASLVQKYEECKADERSK-----TSWPHFEDMN 897
                          7********************************************************************95.....36******97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:3.60.15.101.1E-66106356IPR001279Metallo-beta-lactamase
SuperFamilySSF562812.25E-72107521IPR001279Metallo-beta-lactamase
SMARTSM008492.6E-25119316IPR001279Metallo-beta-lactamase
PfamPF127064.0E-10131266IPR001279Metallo-beta-lactamase
PfamPF075215.5E-7461493IPR011108Zn-dependent metallo-hydrolase, RNA specificity domain
PROSITE profilePS500907.956810874IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.603.5E-5811876IPR009057Homeodomain-like
PfamPF138371.5E-17816898No hitNo description
CDDcd122031.19E-22816881No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009658Biological Processchloroplast organization
GO:0009942Biological Processlongitudinal axis specification
GO:0060918Biological Processauxin transport
GO:0009507Cellular Componentchloroplast
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 909 aa     Download sequence    Send to blast
MMKPASLQGF SSHVSSTIYS DVRRPATTPS KMAAFSALSL CPYTFTFRQS SRIKSTVSCS  60
VTSAPASGTS SSSKTPRRRR EGVGKSMEDS VKRKMEQFYE GTDGPPLRVL PIGGLGEIGM  120
NCMLVGNYDR YILIDAGIMF PDYDEPGVQK IMPDTGFIRR WKHKIEAVVI THGHEDHIGA  180
LPWVIPALDP NTPIFASSFT MELIKKRLKE HGIFVQSRLK TFSTRRRFMA GPFEIEPITV  240
THSIPDCSGL FLRCADGNIL HTGDWKIDEA PLDGKVFDRE ALEELSKEGV TLMMSDSTNV  300
LSPGRTISEK VVADALVRNV MAAKGRVITT QFASNIHRLG SIKAAADITG RKLVFVGMSL  360
RTYLEAAWRD GKAPIDPSSL VKVEDIEAYA PKELLIVTTG SQAEPRAALN LASYGSSHAF  420
KLTKEDIILY SAKVIPGNES RVMKMMNRLA DIGPNIIMGK NEMLHTSGHA YRGELEEVLK  480
IVKPQHFLPI HGELLFLKEH ELLGKSTGIR HTTVIKNGEM LGVSHLRNRR VLSNGFSSLG  540
RENLQLMYSD GDKAFGTSSE LCIDERLRIS SDGIIVLSME IMRPGVSENT LKGKIRITTR  600
CMWLDKGRLL DALHKAAHAA LSSCPVTCPL SHMERTVSEV LRKIVRKYSG KRPEVIAIAT  660
ENPMAVRADE VSARLSGDSS VGSGVAALRK VVDGHSKKSR PKKAPSQEDA PEEIDRTLED  720
DIIDSARLLA EEETAASTYT EEVKMPVGSS SEESDDFWKS FISPSSSPSP GETENVNKVT  780
GTEPKTEDKE SSRDDDNPSD TSDSETKPSS KRVRKNKWKP EEIKKVIRMR GELHSRFQVV  840
KGRMALWEEI SSNLSAEGIN RSPGQCKSLW ASLVQKYEEC KADERSKTSW PHFEDMNNIL  900
SELDTPAS*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5a0t_A3e-8810766119560RIBONUCLEASE J
5a0t_B3e-8810766119560RIBONUCLEASE J
5a0v_A3e-8810766119560RIBONUCLEASE J
5a0v_B3e-8810766119560RIBONUCLEASE J
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK2212720.0AK221272.1 Arabidopsis thaliana mRNA for putative protein, complete cds, clone: RAFL24-15-P14.
GenBankBT0042100.0BT004210.1 Arabidopsis thaliana clone RAFL16-01-C03 (R50172) unknown protein (At5g63420) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006279731.10.0hypothetical protein CARUB_v10027519mg
TrEMBLR0G7Y90.0R0G7Y9_9BRAS; Uncharacterized protein
STRINGAT5G63420.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM95392634
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G63420.10.0Trihelix family protein